#### **EXPERIMENT #2**

# A Logic Processor

## I. OBJECTIVE

In this experiment, you will design and build a bit-serial logic operation processor. The design will utilize two 4-bit shift registers, several multiplexers, and some type of counter. The circuit will be capable of calculating eight different functions and routing the results of those operations in four different ways. A finite state machine will be implemented to serve as the control unit of the circuit.

## II. INTRODUCTION

In the experiment on storage, shift registers were used to store data by shifting a bit out of one end of the shift register and then shifting the same bit into the other end. However, when a write operation was performed, a <u>new</u> bit was shifted into the register. Taking this process one step further, we will modify, not one, but all the data bits in the register using a designated logical function. The circuit will provide the capability of bitwise logical operations. We only want to perform the logical function once on each bit of the data, so we will also need to keep the data from circulating more than once through the circuit.

The operations that this circuit will perform are similar to the bit-wise logical functions provided in machine level programming. For example, your circuit will be able to perform a bit-wise AND of the two operands that are stored in the two registers (RegA and RegB) and be able to place the result in either of the registers, leaving the other register unchanged in the process. A bit-wise AND means that each bit of RegA is logically ANDed with the corresponding bit in RegB. So, if RegA =  $a_3a_2a_1a_0$  and RegB =  $b_3b_2b_1b_0$  then the destination register (RegA or RegB) will hold ( $a_3b_3$ ), ( $a_2b_2$ ), ( $a_1b_1$ ), ( $a_0b_0$ ) after the computation is complete. This is equivalent the machine code instruction of the form:

AND R0, R1, R0 /\* R0 and R1 = > R0 \*/

The other functions are OR, XOR, NAND, NOR, XNOR, CLR, SET, and SWAP. The complete set of functions and their corresponding control inputs are tabulated below (Table 1).

The block diagram of the circuit you will design for this experiment is shown below (Fig. 1). It includes

- a register unit that contains two 4-bit registers, which we will refer to as RegA and RegB,
- 2) a computation unit that executes the desired logical computation,
- 3) a **routing unit** that routes the signals back to the register unit after computation, and
- 4) a **control unit** that generates control inputs to the register unit.



Figure 1: Block Diagram

# **Register Unit**

The register unit will be made up of two 4-bit shift registers that will hold the values of Register A (RegA) and Register B (RegB). The contents of these registers should be displayed so that the contents before and after execution can be inspected. The control of these registers will be provided from the Control Unit while the serial input will be provided from the routing unit.

# **Computation Unit**

The computation unit will accept as inputs the contents of RegA and RegB, and the function selection inputs F2, F1, F0. The unit will output the logical function f(A, B) specified by <F2, F1, F0> and will also output the A and B inputs unchanged. The three outputs will be fed to the Routing Unit.

# **Routing Unit**

The routing unit will accept the A, B, and f(A, B) inputs and, based on the routing selection inputs R1, R0, will determine which signals to feed to the A' (new A) and B' (new B) outputs.

**TABLE 1: Functions** 

| Function<br>Selection Inputs |    |    | Computation Unit Output |
|------------------------------|----|----|-------------------------|
| F2                           | F1 | F0 | f(A, B)                 |
| 0                            | 0  | 0  | A AND B                 |
| 0                            | 0  | 1  | A OR B                  |
| 0                            | 1  | 0  | A XOR B                 |
| 0                            | 1  | 1  | 1111                    |
| 1                            | 0  | 0  | A NAND B                |
| 1                            | 0  | 1  | A NOR B                 |
| 1                            | 1  | 0  | A XNOR B                |
| 1                            | 1  | 1  | 0000                    |

| Routing |       |              |            |
|---------|-------|--------------|------------|
| Sele    | ction | Router Outpu |            |
| R1      | R0    | <b>A*</b>    | <b>B</b> * |
| 0       | 0     | A            | В          |
| 0       | 1     | Α            | F          |
| 1       | 0     | F            | В          |
| 1       | 1     | В            | A          |

#### **Control Unit**

The control unit will accept the following inputs: Load A, Load B, Execute, and the clock signal. The Load A and Load B inputs will perform parallel loads from the data input switches (D3-D0) into the A and B registers. Execute tells the control unit that the select switches and the register contents are ready for execution and that the control unit should begin the computation cycle. The control unit then shifts the register unit the required number of times and halts until the next execution is requested. Obviously, some type of mechanism to keep track of the shifts will be required. The clock input should be taken from the function generator to make the computation cycle appear to be instantaneous while also leaving the debugging capacity of single stepping.



Figure 2: State Diagram

To accomplish this, a finite state machine is devised to control the operation of the register unit. There are two common state machine types: the Moore machine and the Mealy machine. During operation, both machines take in a set of inputs, transitions through a finite number of states, and output the relevant controls. The biggest difference between the two state machines is that the outputs of the Moore machine depend solely on the current state, each serving a specific output configuration, while the outputs of the Mealy machine depends on a combination of the current state and the current inputs. From this point of view, it is apparent that the Mealy machine will be able to achieve the same level of control by using fewer states than what's required by the Moore machine, which also makes the circuit implementation a little bit easier.

Table 1 shows an example of the Mealy machine for our control unit. The inputs of the Mealy machine are the 'Execute' switch, a single-bit state representation 'Q', and two-bit count 'C1C0'. The 'Execute' switch dictates when the circuit should initiate the computation cycle. The single-bit state 'Q' split the Mealy machine into two states that serves distinct purposes – one is the reset/rest state, and the other one is the shift/halt state. And the count bits 'C1C0' are used to keep track of the number of shifts in the shift/halt state. Note that 'C1C0' represents a simplification, using a counter here reduces the number of states we must explicitly encode. The outputs of the mealy machine are the output signal 'Reg. Shift' ('S') which goes to the register unit, the next state 'Q', and the next count 'C1C0'. Notice that the state descriptions are not very precise here, and this is the characteristics of the Mealy machine. Unlike the Moore machine where every state is directly linked to an operation and thus the purpose is clearly defined, the Mealy machine groups similar operations into a single state, then uses a combination of the current state and the current inputs to perform an operation.

To produce a state machine, you should follow the actual sequential operation of the circuit, where the state and the counts starts from (QC1C0='0000'), and the 'Execute' switch is held low ('E'='0'). As long as the 'Execute' switch remains low, the circuit stay in a rest state, where the register unit stays put and the next state and count also stay unchanged (SQ+C1+C0+='0000'). However, at the immediate clock edge after the 'Execute' switch is flipped up (EQC1C0='1000'), the state machine moves to the shift/halt state, and sends out the signal to shift the registers and begins to increment the counter (SQ+C1+C0+='1101'). The state machine in total should then carry out three additional shifts regardless of the condition of the 'Execute' switch. After the four shifts, the state machine will state in (SQ+C1+C0+='0100') if the 'Execute' switch remains high, or transitions back to (SQ+C1+C0+='0000') if the 'Execute' switch drops low. This completes one full cycle of bit-serial logic operation as the state machine comes back to where it had started and awaits for another full cycle of operation when the 'Execute' switch is flipped up again.

To transform the state machine into a physical circuit, we will next build a state transition table. Follow through the entire operation cycle to fill out as much of the state transition table as possible. You will then notice that not all combinations are valid. For example, if we are in the reset/rest state, it is not possible for our counts to hold any value other than '00', and therefore we will never encounter, for example, (EQC1C0='0011') during our circuit operation. If the combination is not valid, a 'don't care' ('d') should be

placed in the outputs for easier implementation of the circuit (remember that it is ok to circle the K-map minterm over the 'don't cares' without affecting the functionality of the circuit).

TABLE 1: Control unit state transition table using the Mealy state machine

| Exec.<br>Switch ('E') | Q | C1 | C0 | Reg. Shift<br>('S') | $Q^+$ | C1 <sup>+</sup> | C0 <sup>+</sup> |
|-----------------------|---|----|----|---------------------|-------|-----------------|-----------------|
| 0                     | 0 | 0  | 0  | 0                   | 0     | 0               | 0               |
| 0                     | 0 | 0  | 1  | d                   | d     | d               | D               |
| 0                     | 0 | 1  | 0  | d                   | d     | d               | D               |
| 0                     | 0 | 1  | 1  | d                   | d     | d               | D               |
| 0                     | 1 | 0  | 0  | 0                   | 0     | 0               | 0               |
| 0                     | 1 | 0  | 1  | 1                   | 1     | 1               | 0               |
| 0                     | 1 | 1  | 0  | 1                   | 1     | 1               | 1               |
| 0                     | 1 | 1  | 1  | 1                   | 1     | 0               | 0               |
| 1                     | 0 | 0  | 0  | 1                   | 1     | 0               | 1               |
| 1                     | 0 | 0  | 1  | d                   | d     | d               | D               |
| 1                     | 0 | 1  | 0  | d                   | d     | d               | D               |
| 1                     | 0 | 1  | 1  | d                   | d     | d               | D               |
| 1                     | 1 | 0  | 0  | 0                   | 1     | 0               | 0               |
| 1                     | 1 | 0  | 1  | 1                   | 1     | 1               | 0               |
| 1                     | 1 | 1  | 0  | 1                   | 1     | 1               | 1               |
| 1                     | 1 | 1  | 1  | 1                   | 1     | 0               | 0               |

After the state transition table is made, we then proceed on producing the K-maps and circling the minterms. Since the transitioning of each of the four state machine components (S, Q, C1, C0) is dictated by four inputs (E, Q, C1, C0), you will need to convert the table to four K-maps and obtain the resulting circuits for each of the four components. Notice that while (Q, C1, C0) are internal components of the state machine, the only actual output 'S' will be used to control the shifting of the two registers. To adapt 'S' and 'LoadA', 'LoadB' to the two registers for the shift/load/halt operations, simple combinational logic will need to be devised.

#### **Demo Points Breakdown:**

#### Week 1

1.0 point: Show correct loading of the A and B registers

1.0 point: Show the computation cycle is of the right length

1.0 point: Demonstrate the four routing operations

1.0 point: Demonstrate the eight function operations

2.0 points: Show the computation cycle completes even if the EXECUTE switch is returned to the low position mid-computation (requires a slow clock)

#### Week 2

1.0 point: Functional simulation completed successfully for the 8-bit serial processor (annotations necessary)

1.0 point: RTL block diagram of the 8-bit logic processor extended from 4-bits. This can be automatically generated using Quartus, but you must show your TA you know how to generate this successfully.

2.0 points: Show a SignalTap ILA (Integrated Logic Analyzer) trace of the 8-bit logic processor, performing the same 8'h33 XOR 8'h55 operation as in the testbench. Note that you will likely have to hardcode the values of F and R to have enough switches for Din, Load A, Load B and execute.

## III. PRE-LAB

#### Week 1

Design, document and build the circuit described in Part II. Your circuit should be able to perform correctly all the functions listed. You will want to study each of the chips carefully before deciding on one or the other. Be sure to make your design as efficient as possible (there is more than one way to design this circuit).

A square wave from the Pulse Generator should be used as the basic system clock. Load A, Load B, Execute, D3-D0, R1, R0, and F2-F0 should be inputs from the switches. For working at home, the switches provided with your lab kit as well as the DE10-Lite switchbox may be used. The control unit must be designed to perform the desired function once and only once each time the execute switch is flipped on. Results of the operation should be obtained even if the execute switch is flipped off in the middle of the computation cycle. You may only assume that the execute switch will remain high for at least one full clock period. Display the contents of Register A and Register B on LEDs. You may also want to include an LED that indicates when the computation cycle is complete for debugging purposes.

### Week 2

A. Source code for a SystemVerilog implementation of the 4-bit logic processor has been provided for you. You will create a Quartus Prime project to synthesize, simulate, and extend the 4-bit logic processor to 8-bits. To start, complete the bit-serial logic processor exercise from the Introduction to SystemVerilog and Tutorial (IQT. 1-40). Include a copy of the generated diagram from Quartus of the 8-bit logic processor and the simulation waveform (with annotations) in your Lab 3 lab report and demonstrate you know how to do this to your TA. Additionally, demonstrate the generation of a SignalTap trace showing the operation 8'h33 XOR 8'h55 in your 8-bit logic processor. Due to the limited number of switches on the DE10-Lite FPGA board, you will have to hardwire F and R (perhaps using 'assign' statements) in the SystemVerilog code to perform the appropriate operation. You may then assign the remaining switches to Din, Load A, Load B, and Execute according to the chart at the end of this document to demonstrate the operation.

# IV. <u>LAB</u>

Follow the Lab 2 demo information on the course website.

### V. POST-LAB

Document changes to your design and correct your Pre-Lab write-up, explaining any difficulties you had in debugging your circuit. Outline how the modular approach proposed in the pre-lab help you isolate design and wiring faults, be specific and give examples from your actual lab experience.

Make sure you report discusses the following:

Describe the simplest (two-input one-output) circuit that can optionally invert a signal (i.e., one input determines if the output is equal to the other input or equal to the other input inverted). Explain why this is useful for the construction of this lab.

Explain how a modular design such as that presented above improves testability and cuts down development time.

Discuss the design process of your state machine, what are the tradeoffs of a Mealy machine vs a Moore machine?

What are the differences between ModelSim and SignalTap? Although both systems generate waveforms, what situations might ModelSim be preferred and where might SignalTap be more appropriate?

### VI. REPORT

Write a report, you may follow the provided outline below, or make sure your own report outline includes at least the items enumerated below. Note that your report must be organized into chapters with paragraphs, rather than a bullet list of materials.

#### 1) Introduction

- a. Summarize what high-level function your circuit performs. What operations can the processor do? How many bits can it operate on? Etc. The introduction should be approximately 3 5 sentences.
- 2) Operation of the logic processor
  - a. Describe the sequence of switches the user must flip to load data into the A and B registers.
  - b. Describe the sequence of switches the user must flip to initiate a computation and routing operation.

- 3) Written description, block diagram and state machine diagram of logic processor
  - a. Written description: describe in words each block in the high-level diagram (a short paragraph for at least the register unit, computation unit, routing unit, and control unit).
  - b. Include a high-level block diagram. It is acceptable to use the one in the lab manual, provided it is modified as necessary to reflect what you implemented.
  - c. State Machine Diagram
    - Explicitly state if you used a Mealy or Moore machine (or some hybrid)
    - ii. Label each state (bubble)
      - 1. Give each state a meaningful name.
      - 2. Specify the binary flip-flop values associated with each state.
      - 3. If you are using a Moore Machine, label the values of all meaningful outputs associated with each state.

### iii. Label each arc.

- 1. The combination of inputs which trigger the arc.
- 2. If you are using a Mealy Machine, label the values of all meaningful outputs associated with each arc.
- 3. It is OK to label each bubble/arc with a single identifier and put the rest of the requested info in a table to reduce clutter.
- 4) Design steps taken and detailed circuit schematic diagram.
  - a. Written procedure of the design steps taken.
    - i. If you used k-maps or truth tables during design, include them here. (If you did not need them, you do not need to include them). K-maps are usually only helpful for creating the next state logic in the control unit.
    - ii. Written description of the design considerations taken (did you consider multiple implementations of the same circuit and the tradeoffs of each?)

## b. Detailed Circuit Schematic

- i. Draw a gate level schematic of your circuit. It is OK to use small standard blocks like MUXes, flip-flops, and shift registers, but custom blocks like your control unit must have a gate level schematic.
- ii. If the schematic becomes too large, components such as the control unit can be represented as black boxes on the top-level schematic, and a detailed schematic of that component can be included below.

iii. You <u>must use a CAD tool</u> (either Fritzing or Quartus) for this portion. Note that Quartus has the standard 7400 parts in the schematic editor. Fritzing parts for may be downloaded from the course website.

## 5) Breadboard view / Layout sheet

- a. Use either the layout sheet from the general guide or the Fritzing breadboard view.
  - i. You should have no air-wires, all your wires should be visible though they may be a little bit cluttered as the design is complex.
  - ii. You may use the 'breadboard label' to label the inputs, it is not necessary put all the switches into the breadboard view (you may if you want).
  - iii. Similarly, you may use breadboard label to label the outputs (for the LEDs), you do not need to put in the physical LED strip (you may if you want).
  - iv. If you wish to avoid having too many crossing wires, you may use 'breadboard label' to make virtual connections.
- b. Use multiple breadboards as necessary, your breadboard view should be complete, in the sense that someone should be able to build a working version of the circuit with only your breadboard view.
- c. If using the manual layout sheet from the general guide, follow the rules provided in the general guide.
- 6) 8-bit logic processor on FPGA
  - a. Summary of all .SV modules and the changes you made to extend processor to 8-bits.
  - b. RTL block diagram please only include the top-level design if using the RTL viewer.
  - c. Include a simulation of the processor that has notes (annotations) that give information such as what operation is being performed, where the result was stored, etc.
  - d. Include procedure used to generate SignalTap ILA trace, as well as the result of such trace executing the 8'h33 XOR 8'h55 operation. E.g.: "Step 1, Set the trigger on signal <signal here>..."
- 7) Description of all bugs encountered, and corrective measures taken.
- 8) Conclusion
  - a. Summarize the lab in a few sentences.

b. Answer to all post-lab questions (they may be placed in conclusion or dispersed in more appropriate sections of the report).

## VII. APPENDIX

Module descriptions are an important part of the reports in ECE 385, and since this is the first significant FPGA lab, a brief example of how to write a module description is shown below.

Suppose you needed two 16-bit registers to store operands A and B in an adder that computes the sum of A and B. Here is example code of a 16-bit register with asynchronous reset and synchronous load that can be used for that purpose.

And here is how a section of the report would describe it:

```
Module: reg16.sv

Inputs: [15:0] Din, Clk, Load, Reset

Outputs: [15:0] Dout
```

Description: This is a positive-edge triggered 16-bit register with asynchronous reset and synchronous load. When Load is high, data is loaded from Din into the register on the positive edge of Clk.

Purpose: This module is used to create the registers that store operands A and B in the adder circuit.

Simple modules can have a description and purpose that are just a sentence or two each, but more complicated modules require more detailed descriptions.

### VIII. APPENDIX II

The following are the recommended pin assignments to set up SignalTap for Lab 2.2. Note that because of an insufficient number of inputs once you extend the logic processor to 8 bits, you will still need to hard wire the F and R appropriately (using assign statements) to do the operation specified in the demo points. This is broken up into two sections. The first section consists of the minimal inputs required to do the operation, while the second section has additional LEDs you might wish to use for debugging.

**Pin Assignment Table (Minimal Inputs)** 

| im issignment tuble (illiminal inputs) |          |                                            |  |  |
|----------------------------------------|----------|--------------------------------------------|--|--|
| Port Name                              | Location | Comments                                   |  |  |
| Din[0]                                 | PIN_C10  | On-board slider switch (SW0)               |  |  |
| Din[1]                                 | PIN_C11  | On-board slider switch (SW1)               |  |  |
| Din[2]                                 | PIN_D12  | On-board slider switch (SW2)               |  |  |
| Din[3]                                 | PIN_C12  | On-board slider switch (SW3)               |  |  |
| Din[4]                                 | PIN_A12  | On-board slider switch (SW4)               |  |  |
| Din[5]                                 | PIN_B12  | On-board slider switch (SW5)               |  |  |
| Din[6]                                 | PIN_A13  | On-board slider switch (SW6)               |  |  |
| Din[7]                                 | PIN_A14  | On-board slider switch (SW7)               |  |  |
| LoadB                                  | PIN_B14  | On-board slider switch (SW8)               |  |  |
| LoadA                                  | PIN_F15  | On-board slider switch (SW9)               |  |  |
| Clk                                    | PIN_P11  | 50 MHz Clock from the on-board oscillators |  |  |
| Reset                                  | PIN_B8   | On-Board Push Button (KEY0)                |  |  |
| Execute                                | PIN_A7   | On-Board Push Button (KEY1)                |  |  |

The following second section of pin assignments maps the HEX drivers to the various segments of the FPGA board's 7-segment displays. This is not necessary for SignalTap operation, as the values of RegA and RegB may be captured and displayed via SignalTap even if the LEDs are not mapped. However, your LEDs and 7-segment displays will display scrambled data if these pin assignments are not mapped.

Pin Assignment Table (LEDs/HEX Displays)

| Port Name | Location | Comments                               |
|-----------|----------|----------------------------------------|
| HexBL[0]  | PIN_C14  | On-Board seven-segment display segment |
| HexBL[1]  | PIN_E15  | On-Board seven-segment display segment |
| HexBL[2]  | PIN_C15  | On-Board seven-segment display segment |
| HexBL[3]  | PIN_C16  | On-Board seven-segment display segment |
| HexBL[4]  | PIN_E16  | On-Board seven-segment display segment |
| HexBL[5]  | PIN_D17  | On-Board seven-segment display segment |
| HexBL[6]  | PIN_C17  | On-Board seven-segment display segment |
| HexBL[7]  | PIN_D15  | On-Board seven-segment display segment |

| HexBU[0] | PIN_C18 | On-Board seven-segment display segment |
|----------|---------|----------------------------------------|
| HexBU[1] | PIN_D18 | On-Board seven-segment display segment |
| HexBU[2] | PIN_E18 | On-Board seven-segment display segment |
| HexBU[3] | PIN_B16 | On-Board seven-segment display segment |
| HexBU[4] | PIN_A17 | On-Board seven-segment display segment |
| HexBU[5] | PIN_A18 | On-Board seven-segment display segment |
| HexBU[6] | PIN_B17 | On-Board seven-segment display segment |
| HexBU[7] | PIN_A16 | On-Board seven-segment display segment |
| HexAL[0] | PIN_B20 | On-Board seven-segment display segment |
| HexAL[1] | PIN_A20 | On-Board seven-segment display segment |
| HexAL[2] | PIN_B19 | On-Board seven-segment display segment |
| HexAL[3] | PIN_A21 | On-Board seven-segment display segment |
| HexAL[4] | PIN_B21 | On-Board seven-segment display segment |
| HexAL[5] | PIN_C22 | On-Board seven-segment display segment |
| HexAL[6] | PIN_B22 | On-Board seven-segment display segment |
| HexAL[7] | PIN_A19 | On-Board seven-segment display segment |
| HexAU[0] | PIN_F21 | On-Board seven-segment display segment |
| HexAU[1] | PIN_E22 | On-Board seven-segment display segment |
| HexAU[2] | PIN_E21 | On-Board seven-segment display segment |
| HexAU[3] | PIN_C19 | On-Board seven-segment display segment |
| HexAU[4] | PIN_C20 | On-Board seven-segment display segment |
| HexAU[5] | PIN_D19 | On-Board seven-segment display segment |
| HexAU[6] | PIN_E17 | On-Board seven-segment display segment |
| HexAU[7] | PIN_D22 | On-Board seven-segment display segment |